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A  multivariate  normal  linear  model  in  its  canonical  form  consists  of  a 
data  matrix  X  =  (Xj  ,...,3^ ):p  *  k  with  independent  columns  X^  ~  »D » 

i  -  and  an  independent  p  *  p  Wishart  matrix  S  ~  W  (n,I).  Here 

f^’s  and  I  are  unknown.  This  kind  of  data  arises,  for  example,  if  we  are 
sampling  from  k  different  multivariate  normal  populations  I) t 

i  =  l,...,k,  with  a  sample  of  size  n^  from  the  ith  population.  If  and 
denote  respectively  the  sample  mean  vector  and  the  sample  sum  of 
squares  and  products  matrix,  we  may  write  X^  =  J  n^  X_.  ,  i  =  l,...,k,  and 
S  =  S  +...+  S,  to  get  the  above  setup  where  n=n  +...+  n,  -  k  and 

1  y _  "  1  K 

=  ■/  rn  i  =  l,...,k.  Under  this  formulation  we  consider  the  problems 

of  estimating  I,  the  common  variance-covariance  matrix,  X  '  ,  the  common 
precision  matrix,  and  |Xl,  the  common  generalized  variance.  We  assume  that 
n  *  p  +  1. 


From  the  classical  point  of  view  without  any  decision-theoretic 
consideration,  one  would  simply  ignore  X  (since  f  ^’s  are  unknown)  and 
propose  multiples  of  S,  S  '  and  |s|  which  are  unbiased  for  X,  X  '  and  |  X I 
respectively.  Even  from  a  decision-theoretic  point  of  view  with  the 
availability  of  a  suitable  loss  function  for  each  of  these  problems,  one  would 
normally  be  tempted  to  use  very  simple  estimates  like  best  multiples  of  S,  S_I 
and  |S  |  especially  for  invaiant  loss  functions.  To  illustrate  this  point 
clearly,  let  us  first  write  down  the  various  loss  functions  which  are  commonly 
used  in  this  context.  We  denote  by  L(X,X)  the  loss  is  estimating  X  by  X-  A 
similar  notation  is  used  for  L(X-,iI-1)  and  L(|xl>lxl)> 

(1.1)  L ,(X,X)  =  tr(X  -  I)' 

L2d,l)  =  tr(X  X~‘  -  I)2 
L3(X,X)  =  tr  x  l  -  <n!i  X-,l  -  P 

L  (X , X I Q)  =  .X  .  (*■  i  -  V  . ) 2q .  .,  q  *  0  (weights) 


(1.2) 


P 
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\d",r)  =  tr(E_I  -  E~V 

L2(r\rl)  =  tr<r‘z  -  n2 

L3(r*,r‘)  =  tr  Pi  -  lnlPZl  -  P 

L4(E  1 | Q )  =  tr(E  1  -  E  *)2Qi  Q  arbitrary  p.d.  matrix 
(1.3)  L2 ( | E| ,  III )  =  ( lEl  -  I E I ) 2 

L2(|El,  lEl)  =  (lEl/lEl  -  l)a 
l3(IeI,  IeI)  -  ( IeI/IeI )  -  in( i E I / 1 e I )  -  i 
wIeI,  IeI)  =  (IeI/IeI  -  i)a 
ls( IeI ,  IeI )  =  (IeI/IeI)  -  in( IeI/IiI ;  -  i 

The  loss  function  L,(-)  in  (1.1)  and  (1.2)  is  a  generalization  of  squared  error 
loss  and  is  noninvariant.  The  loss  function  L4(")  in  (1.1),  due  to  Perlman 
(1972),  is  also  noninvariant  and  is  a  generalization  of  squared  error  loss  with 
constant  weights  qjj’s.  The  loss  functions  La(‘)  and  L3(-)  in  (1.1)  and  (1.2) 
are  invariant,  L3(-)  in  (1.1)  being  the  entropy  loss  introduced  in  James  and 
Stein  (1961)  while  La(-)  in  (1.1)  is  proposed  in  Selliah  (1964).  L3(‘)  in  (1.2)  is 

introduced  and  justified  in  Sinha  and  Ghosh  (1986).  All  the  four  loss 
functions  in  (1.2)  are  completely  analogous  to  those  in  (1.1).  L«(*)  in  (1.2) 

appears  in  Haff  (1979a).  The  loss  functions  in  (1.3)  are  all  invariant,  L ,  (• ) 
and  L3(‘)  being  equivalent  as  far  as  risk  is  concerned.  We  mention  in  passing 
that  the  loss  functions  L t  (* ) ,  La(’)  and  L3(*)  in  (l.l)  and  (1.2)  can  be  used 
interchangeably.  For  example,  La(E-1iE~')  in  (1.2)  can  also  bo  used  as  a 
genuine  loss  function  for  I  because  it  satisfies  La(E",,E-1)  *  0  and  '=  0'  if  and 
only  if  -  E*  [Lobb  functions  developed  by  Efron  and  Morris  (1976) 

from  an  empirical  Bayes  argument  are  not  considered  here.] 

When  a  loss  function  is  invariant,  an  affine  (translation  and  multiplication 
by  nonsingular  matrices)  equivariant  estimator  turns  out  to  be  of  the  form  cS 
for  I,  dS_l  for  E_l  and  e|s|  for  |El  where  c,  d  and  e  are  constants.  These 


L 
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estimators  ignore  X  and,  due  to  the  invariant  nature  of  the  loss  function,  have 
constant  risks,  independent  of  £.  It  is  then  possible  to  find  the  best  choice 
of  these  constants,  resulting  in  the  best  affine  equivariant  estimators.  For 
example,  it  is  easy  to  show  that  (assuming  that  n  *  p  +  4) 

(1.4)  c%=  n+p+1  ,c3  =  ^  ,  d^  =  (n-p) (n-p-3)/ (n-1) ,  n-p-1, 

ei  r  ea  :  e^  =  (n-p)!/n!,  e^  =  (n-p-4) /(n-4) !  , 
es  =  (n-p-2) ! / (n-2) ! 

where  c2  is  the  best  choice  of  the  constant  c  for  the  loss  function  L2  (• )  in 
(1.1)  and  so  on. 

The  question  then  arises  whether  one  should  be  content  with  these 
simple  estimators  or  look  for  improved  estimators.  For  a  decision-theorist,  it 
is,  of  course,  essential  to  know  if  these  best  multiples  of  S,  S~*  and  |  s|  are 
admissible  for  the  respective  problems.  Unfortunately,  it  turns  out  none  of 
these  estimators  is  admissible  and,  in  fact,  there  are  many  ways  to  improve 
over  them.  The  improved  estimators,  however,  lack  simplicity  and,  unlike  for 
the  above  simple  estimators,  their  frequency  properties  are  extremely  difficult 
to  study  analytically.  Numerical  computations  (Lin  and  Perlman  (1985),  Dey 
and  Srinivasan  (1984),  Sinha  and  Ghosh  (1986))  show  that  risk  improvements 
are  marginal  in  many  cases  but  substantial  in  some  cases. 

The  various  methods  leading  to  improved  estimators  of  X,  X-1  and  |X|  are 
reviewed  in  this  paper.  Broadly  speaking,  there  are  three  methods:  (i) 

minimax  estimators  due  to  James  and  Stein  (1961)  and  recently  modified  and 
improved  by  Dey  and  Srinivasan  (1985);  (ii)  empirical  Bayes  estimators 
developed  by  Haff  (1977,  1979a,  1979b,  1980);  and  (hi)  Stein’s  testimators  (Stein 
(1964))  as  developed  by  Shorrock  and  Zidek  (1976),  Sinha  (1976),  and  Sinha 
and  Ghosh  (1986). 

Some  open  problems  are  mentioned  in  the  concluding  section. 


2.  Minimax  Estimators. 
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To  improve  over  the  best  affine  equivariant  estimators,  James  and  Stein 

(1961)  considered  a  somewhat  smaller  group  of  translation  and  multiplication 

by  p  *  p  lower  triangular  matrices  with  positive  diagonal  elements.  The 

component  translation  of  this  group  eliminates  X  and  the  estimators  which  are 

equivariant  under  the  triangular  group  G+  of  the  above  type  turn  out  to  be 

of  the  form  KaK’  where  K  t  GJp  satisfies  S  =  KK*  and  *  is  a  diagonal  matrix. 

For  an  invariant  loss  function,  the  risk  of  such  an  estimator  is  independent 

of  £  and  depends  only  on  the  diagonal  elements  of  *  (apart  from  n  and  p). 

To  compute  the  risk,  one  has  to  use  the  simple  fact  that  if  S  Wp(n,I)  and  S 

=  KK’  with  K  e  Q+,  then  K.2.  X2  .  ,  K. .  ~  N(0,1),  j  ^  i,  and  all  variables 
T  11  n-i+i  ji 

in  K  are  independent  (see  Kshirsagar  (1978)).  It  is  then  possible  to  find  the 
best  choice  of  the  diagonal  elements  of  a,  thus  resulting  in  the  best  G,+- 
equivariant  estimator  which,  if  different  from  the  best  affine  equivariant 
estimator,  is  certainly  an  improvement  over  it.  Moreover,  because  of  the 

solvability  of  the  group  G+T  (Kiefer  (1957)),  the  resultant  estimator  with 
constant  risk  is  automatically  minimax.  This  procedure  can  be  followed  to  get 
improved  estimators  for  the  invariant  loss  functions  L2(')  and  L3  (• )  in  (1.1) 
and  (1.2).  For  example,  for  the  loss  function  L2(’)  in  (1.1),  the  minimax 
estimator  dominating  the  best  affine  equivariant  estimator  c2S  is  given  by 
Ka0K*  where  the  diagonal  elements  <5,,...,<5p  of  a0  are  the  solutions  of  the 
equations 

(2.1)  (n+p-1 )  (n+p+l)<5 ^  +  (n+p-3)<5^  +...+  (n-p+l)<5^  =  n  +  p  -  1 

(n+p-S)^  +  (n+p-3)  (n+p-l)«5a  +...+  (n-p+l)<5^  =  n+p-3 


(n-p+l)<5  +  (n-p+l)<5  +. .  ,+(n-p+l)(n-p+3)<5  =  n-p+1 

12  P 

Similarly,  the  minimax  estimator  dominating  the  best  affine  equivariant 
estimator  c3S  for  the  loss  function  L3(’)  in  (1.1)  is  given  by  Ka,  K*  with  the 
ith  diagonal  element  i[  of  a,  being  equal  to  (n+p-2i+ 1 )“ 1  ,i  =  l,...,p.  Analogous 
results  can  be  obtained  for  the  loss  functions  L2(’)  in  L3(')  in  (1.2). 
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Romurk  2.1.  The  above  procedure  does  not  work  for  the  loss  functions  in 
(1.3)  relevant  for  estimating  the  generalized  variance  because  |k^k’|  =  e|s| 
for  some  e,  which  is  also  affine  equivariant.  A  method  for  improving  over 
e  |  S  |  is  discussed  in  Section  4. 

Remark  2.2.  A  drawback  of  the  minimax  estimators  presented  above  is  their 

dependence  on  the  coordinate  system.  Although  this  can  be  overcome  by 

averaging  the  minimax  estimator  over  the  group  of  p  x  p  orthogonal  matrices 

with  respect  to  Haar  measure,  the  resultant  estimator  does  not  have  a  simple 

form  for  p  *  3  (see  Takemura  (1984)).  Another  drawback  is  that  the  amount 

of  risk  improvement  is  very  marginal.  Stein  (1975,  1977a,  1977b)  instead 

considered  the  class  of  orthogonally  invariant  estimators  of  the  form  £  =  R 

♦  (I.)  R  where  S  -  RLR  with  R  the  matrix  of  normalized  eigen  vectors  (RR*  = 

» 

R  R  -  I),  I  -  diag(<(  is  the  diagonal  matrix  of  eigen  values  of  S 

with  i  *  I  ,  and  ♦ t  L  >  =  (♦ ( ( L) L) ) .  For  estimating  £  under 

the  entropy  loss  function  !,,('•}  in  (1.1),  Stein  (1975)  proposed  using 

(2.2)  *'  il.  i  hi  p*  1  '  •  2*.  £  1/(1.  -<  )],  i-l,...,p. 

i  i  i  jXj  i  j 

However,  the  risk  function  of  the  resultant  estimator  is  very  complicated  and 
it  has  not  been  theoretically  determined  that  this  estimator  dominates  c3S,  the 
best  affine  equi variant  estimator  for  this  loss,  although  Monte  Carlo  simulation 
results  of  Lin  and  Perlman  (1985)  indicate  this  to  be  the  case.  Recently,  for 
the  loss  function  !,,(•)  in  (1.1),  Dey  and  Srinivasan  (1985),  by  successfully 
applying  Stein’s  technique,  obtained  orthogonally  invariant  minirnax  estimators 
which  are  improvements  over  c3S  as  well  as  over  the  minimax  estimator 
presented  before  for  p  *  3.  The  improved  estimators  which  are  obtained  by 
solving  a  certain  differential  inequality  have  very  simple  forms.  Also  the 
percentage  risk  improvements  over  the  risk  of  c3S  and  over  the  minimax  risk 
are  both  significant,  except  for  very  large  n  (Dey  and  drivivasan  (1984)).  We 
may  mention  that  Haff’s  (1982)  modification  of  Stein’s  estimator  £  also  performs 
quite  well.  Dey  and  Srinivasan’s  (1985)  improved  estimator  over  c3S  is  given 
by  R  ♦  (L)r’  and  that  over  the  minimax  estimator  K^,  K*  by  R  *1(L)r\  The 
components  of  ♦  (L)  are  obtained  as 
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(2.3)  *°(L)  =  <./n  -  (<.  log  <.)ro(u)/(bo  +  u) ,  i  = 

where  u  =  f  log2l^,  bQ  *  144(p  -  2)J/(25n2)  is  a  constant,  and  TQ(u)  is  a 

function  satisfying  (i)  0  *  r  (u)  *  2(p  -  2)/n*,  n*  =  5n2/6;  (ii)  r  (u) 

O  0 

> 

monotone  nondecreasing  in  u  and  E[tq(u)]  ^  ».  On  the  other  hand,  the 
components  of  ♦  (L)  are  given  by 

(2.4)  *^(L)  =  ^<5*  -  (<i  log  <i)Ti(u)/(bj  +  u),  i  =  l,...,p, 

where  b(  *  144(p  -  2) 2/{25(n+p-l ) 2 }  is  a  constant,  and  (u)  is  a  function 
satisfying  (i)  0  ^  Ti(u)  *  12(p  -  2)/(5(n+p-l) 2 ) ,  (ii)  r(u)  monotone  non¬ 
decreasing  in  u  and  Efr  (u)]  ^  «■.  Incidentally,  another  very  simple 
estimator  R  ♦2(L)R  with  <2(L)  =  4  <5 !. ,  i  =  l,...,p,  is  also  minima*  and  is 

an  improvement  over  both  c3S  and  K*,K  for  the  entropy  loss  function  L3(‘)  in 

(1.1)  (Stein  (1982),  Dey  and  Srivivasan  (1985)). 

3.  Empirical  Bayes  Estimators 

A  unified  approach  to  constructing  estimators  of  E  and  substantially 

better  (in  terms  of  amount  of  risk  improvement)  than  the  best  affine 
equivariant  estimators  for  invariant  loss  functions  in  (1.1)  and  (1.2)  and 
otherwise  better  than  the  unbiased  estimators  has  been  successfully  developed 
in  a  aeries  of  papers  by  Haff  (1977,  1979a,  1979b,  1980).  Assuming  an  Wishart 
conjugate  prior  for  E~ 1 ,  I-1  ~  Wp(n',C",/r),  n’  *  p,  y  ^  0,  C  p.d.,  which 
results  in  the  Wishart  posterior  for  I-1,  I~'|S  ~  Wp(n  +  n’,  (S  +  y  C)-1  ),  it 
turns  out  that  for  most  loss  functions  in  (1.1)  and  (1.2),  the  estimator 
minimizing  the  Bayesian  expected  loss  is  of  the  form  E  -  a(S  +  y  C)  for  some 
constant  a  *  0.  An  empirical  Bayes  estimator  is  then  obtained  by  pretending 
that  y  is  unknown  and  suitably  estimating  it  using  the)  marginal  density  of  S 
which  is  proportional  to  7Pr,,t|s|^n  |s  +  7  Cj  ^n+n  ^ '  2.  Quite  generally, 

for  estimating  E,  estimators  of  the  form 

(3.1)  E  =  a{S  +  ut(u)Ip} 

arc  proposed  where  for  simplicity,  C  is  taken  ns  identity,  a  is  a  positive 
ronstnnt,  u  is  an  average  eigen  value  of  S,  and  t(u)  is  nonnegative,  bounded 


-  8  - 


and  noniricreasing.  On  the  other  hand,  for  estimating  the  precision  matrix 
I-1,  estimators  of  the  form 

(3.2)  P  =  b{s"'  +  v  t(v) I  } 


are  proposed  where  b  *  0  is  a  constant,  v  is  an  average  eigen  value  of  S-1, 

and  t(v)  is  nonnegative,  bounded  and  nonincreasing.  In  applications, 

depending  on  the  loss  function,  u(v)  maybe  the  arithmetic,  geometric  or 

harmonic  mean  of  the  eigen  values  of  S(S-1).  By  (repeatedly)  applying  the 

extremely  powerful  Wishart  Identity  (Stein  (1975),  Haff  (1979b)),  Haff  was  able 

to  construct  estimators  of  the  forms  given  in  (3.1)  and  (3.2)  better  than 

c  .S  and  d  LS"’  for  all  the  loss  functions  in  (1.1)  and  (1.2)  respectively 
opt  opt 

(except  for  the  loss  L 3  ( • )  in  (1.2)  which  is  new).  To  get  a  flavor  of  Haff-type 
improved  estimators,  we  mention  below  a  few  results.  Note  that  the  loss 
function  L ,  ( * )  in  (1.1)  or  (1.2)  is  a  special  case  of  corresponding  L*(-)  when  Q 
is  taken  as  the  identity  matrix.  We  denote  by  £)  and  the  improved 

estimators  due  to  Haff  when  the  loss  function  is  Lj(*)  in  (1.1)  and  (1.2) 
respectively. 

(3.3)  =  a{S  +  u  t(u)Ip) ,  a  =  (n+p+1)  u  =  l/tr(S  ’), 

0  *  t  *  2(p-l)/(n-p+3) ,  t  a  constant,  dominates  c^S. 

(3.4)  =  a{S  +  u  t(u)Ip),  a  =  n  ’,  u  =  l/tr(S  ) , 

0  *  t(u)  *  2(p-l)/n,  an  absolutely  continuous  and 
nonincreasing  function,  dominates  c3S.  (If  t  is  chosen  as 
a  constant,  the  optimal  value  of  t  is  (p-l)/n) 


(3.5)  -  afS  +  u  t(u)I  } ,  u  =  |sj'/p,  t(u)  satisfying 

4  P 

(i)  (4q*/pna)u  t’(u)  +  2q*(a  -  (pn  -  2)/pnJ]t(u)  +  a*t2(u)  *  0 
(ii)  u  t'(u)  +  2t  (u)  ‘  0 

where  q*  =  p(H  q..)'^P/i  q..,  dominates  a  S,  whatever  Q. 
,11  ,11 

(If  t  is  chosen  ns  a  constant,  0  ^  t  ^  2q*( (pn~2)/pnJ-a] /a3 
will  do. ] 


Remark  3.1.  The  result  for  the  loss  function  L,(‘)  in  (1.1)  is  obtained  by 
putting  Q  -  Ip  i.e.,  q*  =  1  in  (3.5).  In  this  case  a  is  usually  taken  as  n_1. 
There  are,  of  course,  simpler  improved  estimators  (improvement  over  n~‘S)  for 
this  loss  function.  Perlman  (1972)  proved  that  P  n-1S  dominates  n~’S  for  any 
P,  (np-2)/(np+2)  *  p  1.  Such  a  simple  dominance  result  holds  for  the  loss 
function  L4(‘)  in  (1.1)  as  well.  It  is  proved  in  Haff  (1979b)  that  a  S  dominates 
n~'S  for  any  a,  (n-l)/(n(n+l))  6  a  z  n-1,  whatever  be  Q.  However,  unlike  for 
the  estimator  in  (3.5),  the  amount  of  risk  improvement  for  these  simple 
estimators  is  marginal. 

Remark  3.2.  The  identity  matrix  Ip  in  (3.3)  and  (3.4)  can  be  replaced  by  any 
p.d.  matrix  C  in  which  case  u  need  be  redefined  as  u  =  l/(trS-,C). 

(3.6)  X2'^  =  b{S“ 1  +  vt(v)Ip},  b  =  (n-p-3)  (n-p)/  (n-1 ) , 

v  =  l/tr(S) ,  0  ^  t(v)  *  2 (p- 1 )/ (n-p) 
and  t’(v)  <  0,  dominates  d2S_1 . 

(3.7)  X4'  =  b{S  '  +  vt  ( v)  Ip} ,  v  =  |  S  |  t(v)  satisfying 

{(4/p) [vt ’ (v)  +  t(v)J  +  2b*t(v)}q* 

+  btJ(v)  *  0,  where  b*  =  b  -  (n-p-1), 


dominates  b  S  ',  whatever  Q. 

Remark  3.3.  The  result  for  the  loss  function  L,(’)  in  (1.2)  is  obtained  from 

(3.7)  by  putting  Q  =  Ip  i.e.,  q*  =  1.  In  this  case  b  is  usually  taken  as 
(n-p-1).  Again,  for  this  loss  function  also,  there  are  simpler  improved 
estimators.  Haff  (1977)  showed  that  for  any  b,  n-p-3  ^  b  n-p-1,  b  S-1 
dominates  (n-p-1 )S_l.  However,  the  amount  of  risk  improvement  is  marginal. 

Reniurk_  3^4.  For  the  loss  function  L4(')  in  (1.2),  Haff  (1979a)  obtained  another 
class  of  more  improved  estimators  of  X-1.  This  is  not  discussed  here. 
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far  no  Haff-type  improved  estimator  is  available  which 
for  estimating  the  generalized  variance  III  under  the  loss 
Also,  no  Haff-type  dominance  result  is  known  for  the  loss 
function  L3(*)  in  (1.2)  which  improves  over  d3S_I  for  estimating  I-1. 
Dominance  results  for  these  problems  are  presented  in  the  next  section. 

4.  Stein's  testimators. 

The  methods  described  in  Sections  2  and  3  above  for  improving  over  the 

best  affine  equivariant  estimators  for  the  loss  functions  in  (1.1)  and  (1.2)  are 

based  on  different  functions  of  S,  but  ignoring  X  all  the  time.  Clearly  these 

methods  do  not  work  when  p  =  1.  This  is  because  for  estimating  the  variance 

cr2  of  a  normal  population  with  unknown  mean  n,  if  the  sample  mean  is  ignored, 

the  best  multiple  of  the  sample  variance  is  admissible  under  both  entropy  loss 

and  squared  error  loss.  Stein  (1964)  came  up  with  an  ingenius  idea  to  use 

the  sample  mean  along  with  the  sample  variance  in  order  to  obtain  an 

improved  estimator  of  a2.  For  the  squared  error  loss,  Stein’s  improved 

-  n  n 

estimator  has  the  form  <r2  =  min[(n+l)  1  I  (yi~y)2,  (n+2)-1  I  (yi-^o)2]  where 

i  i 

/j o  is  any  fixed  constant.  Such  an  estimator  is  also  called  a  testimator  because 

of  its  obvious  interpretation  as  the  result  of  a  suitable  test  procedure. 

Recently,  Stein's  (1964)  technique  has  been  successfully  generalized  in 
the  multivariate  case  to  deal  with  all  the  loss  functions  in  (1.3)  and  the 
entropy  losses  in  (1.1)  and  (1.2).  We  now  have  testimators  which  are  better 
than  e0pt,|s|  for  estimating  | £ | ,  than  c3S  for  estimating  I  when  the  loss  is 
L3(*)  in  (1.1),  and  than  d3S_l  for  estimating  I-1  when  the  loss  is  L3(*)  in 
(1.2).  See  Sinha  (1976),  Sinha  and  Ghosh  (1986),  and  alsc  Shorrock  arid  Zidek 
(1976).  However,  simulation  results  show  that  the  amount  of  risk  improvement 
is  marginal  in  most  cases  except  when  the  loss  is  L3(’)  in  (1.2).  A  brief 
description  of  these  testimators  is  given  below.  We  use  the  notation  X,  S  as 
given  in  the  Introduction. 

For  estimating  III  under  the  loss  function  L,(«)  or  L2(’)  or  L3(*)  in  (1.3), 
the  testimator  better  than  e,|s|  is  given  by  |£j,  =  min{ , 1  |s  +  XX  |t 


Remark  3.5.  So 
dominates  e0p{,  |  S  | 
functions  in  (1.3). 
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| S | }  -  Under  the  loss  functions  L4(-)  and  Ls  ( • )  in  (1.3),  the  improved 
testimators  are  respectively  given  by  |E|4  =  min{ (n-4+10 !  +  \t 

|s| }  „nd  IJI.  =  Is  *  xx'l.  |s|).  On  the  _ 

other  hand,  for  estimating  £  when  the  loss  is  L,(-)  in  (1.1),  the  testimator  I  = 
(S  +  XX’)/(n+k)  if  Xmax(X,S_*X)  *  k/n,  =  S/n  otherwise,  dominates  the  best 
affine  invariant  estimator  S/n.  For  the  other  loss  function  L3(')  in  (1.2),  the 
improved  testimator  I-1  =  (n-p-l+k)(S  +  XX’)-1  if  Xmax(x’s-*  X)  *  k/(n-p-l),  = 
(n-p-l)S~‘  otherwise,  dominates  d3S_l  for  estimating  I-1.  Here  Xmax(')  stands 
for  the  largest  eigen  value  of  (•)• 


A  general  procedure  to  obtain  improved  testimators  is  to  consider 
estimators  equivariant  under  a  nonnormal  subgroup  H  of  the  full  affine  group 
G.  Typically,  the  group  G  acts  on  (X,S)  and  ((,1)  as  X  ■*  AX  +  B,  (  -*  Af  +  B, 

S  -»  ASA*,  £  -»  A£a’  where  A:p  *  p  is  nonsingular  and  B:p  *  k  is  any  matrix. 

Here  (:p  *  k  r  (fi,...,fk).  The  subgroup  H  is  obtained  from  G  by  putting  B  r 
0.  It  turns  out  that  estimators  of  £  equivariant  under  H  are  of  the  form  £  = 
W-fw’  where  S  =  ww',  W:p  *  p  is  nonsingular,  and  V  ■  V(UU’):p  «  p  with  U  = 
W~*X.  For  a  specified  loss  function,  the  best  choice  of  the  function  Y(') 
usually  involves  the  unknown  parameters  {  and  £  besides  being 
computationally  intractable.  This  is  also  the  case  for  estimating  £-1  and  |El 
for  which  estimators  of  the  form  £-1  =  w’_  1  V~ '  W- 1  and  |£l  -  |s|y(X’s_,X) 

respectively  are  considered  where  Y  in  the  second  case  is  a  scalar.  However, 
for  most  problems,  an  upper  bound  of  the  optimum  V,  say  V 0,  is  available. 
Because  of  the  convex  nature  of  the  loss  functions,  it  then  follows  that  given 
any  estimator  with  a  specified  V*»  putting  V  =  min(YoiV*)  results  in 
improvement  of  risk.  When  V*  and  Vo  are  matrices,  V  is  defined  as  Y  =  Vo  if 
Vo  *  V*  =  V*  otherwise,  where  Vo  *  V*  means  V*  -  Vo  is  n.n.d.  This  is  the 

basis  of  derivation  of  all  the  testimators  discussed  in  this  section. 

Remark  4.1.  It  is  interesting  to  observe  that  the  testimators  of  £,  I-1  of  |El 
defined  above  are  based  on  different  test  statistics.  For  E  and  E-t,  a  test 
based  on  the  maximum  root  of  x’s^X  iB  used,  while  for  jEl  it  is  the  likelihood 
ratio  tost.  In  all  the  cases,  the  underlying  null  hypothesis  is  (  =  0. 
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Remark  4.2.  Sequential  testimators  of  £,  I-1  and  |l|,  based  on  suitable  tests 
of  parts  of  f,  all  of  which  dominate  the  corresponding  best  affine  equivariant 
estimators  are  also  available.  See  Sinha  (1976)  and  Sinha  and  Ghosh  (1986)  for 
detail. 


5.  Open  Problems 


The  following  is  a  list  of  the  open  problems  in  the  area  discussed  in  the 
paper. 

(a)  It  will  be  interesting  to  develop  Haff-type  improved  estimators  for 
estimating  the  precision  matrix  I-1  under  the  loss  function  L3(')  in 
(1.2). 

(b)  For  estimating  the  generalized  variance  |  £  |  under  the  loss  functions 
in  (1.3),  so  far  no  Haff-type  improved  estimator  is  available.  Is  it 
true  that  the  estimators  e0p^js|  are  admissible  when  X  is  ignored? 
A  partial  answer  is  available  in  Das  Gupta  (1982). 

(c)  So  far,  Stein’s  testimators  are  available  only  for  the  loss  functions 
L,(’)  in  (1.1)  and  (1.2).  What  happens  for  other  loss  functions? 
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